SEQUENCE LISTING 




(1) GENERAL INFORMATION: 



(i) 



APPLICANT: Oon, Chong Jin 



Lim, Gek Keow 
Leong, Ai Lin 
Zhao, Yi 



Chen, Wei Ning 



(ii) 



TITLE OF INVENTION: A VACCINE- INDUCED HEPATITIS B VIRAL 
STRAIN AND USES THEREOF 



(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Ladas & Parry 

(B) STREET: 26 West 61 Street 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY : USA 

(F) ZIP: 10023 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 09/719 , 533 

(B) FILING DATE: 13-DEC-2000 

( C ) C LAS S I F I CAT I ON : 4 3 5 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/SG98/00045 

(B) FILING DATE: 19-JUN-1998 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Mass, Clifford J. 

(B) REGISTRATION NUMBER: 30,086 

(C) REFERENCE/DOCKET NUMBER : U- 013 108 - 9 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (212) 708-1890 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CTCCACCACT TTCCACCAAA CTCTTCAAGA TCCCAGAGTC AGGGCCCTGT ACTTTCCTGC 60 



TGGTGGCTCC AGTTCAGGAA CAGTGAGCCC TGCTCAGAAT ACTGTCTCTG CCATATCGTC 12 0 



AATCTTATCG AAG AC TGGGG ACCCTGTACC GAACATGGAG AACATCGCAT CAGGACTCCT 180 

AGGACCCCTG CTCGTGTTAC AGGCGGGGTT TTTCTTGTTG ACAAAAATCC TCACAATACC 24 0 

GCAGAGTCTA GACTCGTGGT GGACTTCTCT CAATTTTCTA GGGGGAACAC CCGTGTGTCT 3 00 

TGGCCAAAAT TCGCAGTCCC AAATCTCCAG TCACTCACCA ACCTGTTGTC CTCCAATTTG 3 60 

TCCTGGTTAT CGCTGGATGT GTCTGCGGCG TTTTATCATC TTCCTCTGCA TCCTGCTGCT 4 20 

ATGCCTCATC TTCTTGTTGG TTCTTCTGGA CTATCAAGGT ATGTTGCCCG TTTGTCCTCT 4 80 

AATTCCAGGA TCAACAACAA CCAGCACCGG ACCATGCAAA ACCTGCACAA CTCCTGCTCA 54 0 

AGGAACCTCT ATGTTTCCCT CATGTTGCTG TACAAAACCT ACGGACAGAA ACTGCACCTG 6 00 

TATTCCCATC CCATCATCTT GGGCTTTCGC AAAATACCTA TGGGAGTGGG CCTCAGTCCG 660 

TTTCTCTTGG CTCAGTTTAC TAGTGCCATT TGTTCAGTGG TTCGTAGGGC TTTCCCCCAC 72 0 

TGTCTGGCTT TCAGTTATAT GGATGATGTG GTTTTGGGGG CCAAGTCTGT ACAACATCTT 7 80 

GAGTCCCTTT ATGCCGCTGT TACCAATTTT CTTTTGTCTT TGGGTATACA TTTAAACCCT 84 0 

CACAAAACAA AAAGATGGGG ATATTCCCTT AACTTCATGG GATATGTCAT TGGGAGTTGG 90 0 

GGCACATTGC CACAGGAACA TATTGTACAA AAAATCAAAA TGTGTTTTAG GAAACTTCCT 960 

GTAAACAGGC CTATTGATTG GAAAGTATGT CAACGAATTG TGGGTCTTTT GGGGTTTGCC 102 0 

GCCCCTTTCA CGCAATGTGG ATATCCTGCT TTAATGCCTT TATATGCATG TAT AC AAG C A 10 8 0 

AAACAGGCTT TTACTTTCTC GCAAACTTAC AAGACCTTTC TAAGTAAACA GTATCTGAAC 114 0 

CTTTACCCCG TTGCTCGGCA ACGCCCTGGT CTGTGCCAAG TGTTTGCTGA CGCAACCCCC 12 0 0 

ACTGGTTGGG GCTTGGCCAT AGGCCATCAG CGCATGCGTG GAACCTTTGT GTCTCCTCTG 12 6 0 

CCGATCCATA CTGCGGAACT CCTAGCCGCT TGTTTTGCTC GCAGCAGGTC TGGGG C AAAA 132 0 

CTCATCGGGA CTGACAATTC TGTCGTGCTC TCCCGCAAGT ATACATCATT TCCATGGCTG 13 80 

CTAGGCTGTG CTGCCAACTG GATCCTGCGC GGGACGTCCT TTGTTTACGT CCCGTCGGCG 14 4 0 

CTGAATCCCG CGGACGACCC CTCCCGGGGC CGCTTGGGGC TCTACCGCCC GCTTCTCCGC 15 00 

CTGTTATACC GACCGACCAC GGGGCGCACC TCTCTTTACG CGGACTCCCC GTCTGTGCCT 1560 

TCTCATCTGC CGGACCGTGT GCACTTCGCT TCACCTCTGC ACGTCGCATG GAGACCACCG 162 0 

TGAACGCCCA CGGGAACCTG CCCAAGGTCT TGCATAAGAG GACTCTTGGA CTTTCAGCAA 1680 

TGTCAACGAC CGACCTTGAG GCATACTTCA AAGACTGTGT GTTTAATGAG TGGGAGGAGT 174 0 

TGGGGGAGGA GGTTAGGTTA AAGGTCTTTG TACTAGGAGG CTGTAGG CAT AAATTGGTGT 180 0 

GTTCACCATC ACCATGCAAC TTTTTCACCT CTGCCTAATC ATCTCATGTT CATGTCCTAC 1860 

TGTTCAAGCC TCCAAGCTGT GCCTTGGGTG GCTTTGGGGC ATGGACATTG ACCCGTATAA 192 0 



AGAATTTGGA GCTTCTGTGG AGTTACTCTC TTTTTTGCCT TCTGACTTTT TTCCTTCTAT 198 0 

TCGAGATCTC CTCGACACCG CCTCTGCTCT GTATCGGGAG GCCTTAGAGT CTCCGGAACA 2 04 0 

TTGTTCACCT C AC C ATACGG CACTCAGGCA AGCTATTCTG AGTTGGGGTG AGTTAATGAA 2100 

TCTAGCCACC TGGGTGGGAA GTAATTTGGA AGATCCAGCA TCCAGGGAAT TAGTAGTCAG 2160 

CTATGTCAAC GTTAATATGG GCCTAAAAAT CAGACAACTA TTGTGGTTTC ACATTTCCTG 222 0 

TCTTACTTTT GGGAGAGAAA CTGTTCTTGA ATATTTGGTG TCTTTTGGAG TGTGGATTCG 22 8 0 

CACTCCTCCT G CAT AT AG AC CACCAAATGC CCCTATCTTA TCAACACTTC CGGAAACTAC 2 34 0 

TGTTGTTAGA CGAAGAGGCA GGTCCCCTAG AAGAAGAACT CCCTCGCCTC GCAGACGAAG 24 00 

GTCTCAATCG CCGCGTCGCA GAAGATCTCA ATCTCGGGAA TCTCAATGTT AGTATTCCTT 24 60 

GGACACATAA GGTGGGAAAC TTTACGGGGC TTTATTCTTC TACGGTACCT TGCTTTAATC 2 52 0 

CTAAATGGCA AACTCCTTCT TTTCCGGACA TTCATTTGCA GGAGGACATT CTTGATAGAT 258 0 

GTAAGCAATT TGTGGGGCCC CTTACAGTAA ATGAAAACAG GAGACTAAAA TTAATTATGC 2 64 0 

CTGCTAGGTT TTATCCAAAT GTTACTAAAT ATTTGCCCTT AGATAAAGGG ATCAAACCAT 2 700 

ATTATCCAGA GTATGTAGTT AATCATTACT TCCAGACGCG ACATTATTTA C AC AC TCTTT 2760 

GGAAGGCGGG GATCTTATAT AAAAGAGAGT CCACACGTAG CGCCTCATTT TGCGGGTCAC 2 82 0 

CATATTCTTG GGAACAAGAT CTACAGCATG GGAGGTTGGT CTTCCAAACC TCGAAAAGGC 2880 

ATGGGGACAA ATCTTTCTGT CCCCAATCCC CTGGGATTCT TCCCCGATCA TCAGTTGGAC 2 94 0 

CCTGCATTCA AAGCCAACTC AGAAAATCCA GATTGGGACC TCAACCCGCA CAAGGACAAC 3 00 0 

TGGCCGGACG CCAACAAGGT GGGAGTGGGA GCATTCGGGC CAGGGTTCAC CCCTCCTCAT 3 060 

GGGGGACTGT TGGGGTGGAG CCCTCAGGCT CAGGGCCTAC TCACAACTGT GCCAGCAGCT 312 0 

CCTCCTCCTG CCTCCACCAA TCGGCAGTCA GGAAGGCAGC CTACTCCCTT ATCTCCACCT 318 0 

CTAAGGGACA CTCATCCTCA GGCCATGCAG TGGAA 3 215 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Pro Leu Ser Tyr Gin His Phe Arg Lys Leu Leu Leu Leu Asp Glu 
15 10 15 



Glu Ala Gly Pro 
20 

Leu Asn Arg Arg 
35 

Ser lie Pro Trp 
50 

Ser Thr Val Pro 
65 

Asp lie His Leu 



Glu Pro Leu Thr 
100 

Ala Arg Phe Tyr 
115 

lie Lys Pro Tyr 
130 

Arg His Tyr Leu 
145 

Glu Ser Thr Arg 



Gin Asp Leu Gin 
180 

Gly Asp Lys Ser 
195 

Ser Val Gly Pro 
210 

Pro Gin Pro Ala 
225 

Gly Ser lie Arg 



Val Glu Pro Ser 
260 

Ser Ser Cys Leu 
275 

lie Ser Thr Ser 
290 

His His Phe Pro 
305 

Leu Ser Cys Trp 



Leu Glu Glu Glu 



Val Ala Glu Asp 
40 

Thr His Lys Val 
55 

Cys Phe Asn Pro 
70 

Gin Glu Asp lie 
85 

Val Asn Glu Asn 



Pro Asn Val Thr 
120 

Tyr Pro Glu Tyr 
135 

Hi s Thr Leu Trp 
150 

Ser Ala Ser Phe 
165 

His Gly Arg Leu 



Phe Cys Pro Glu 
200 

Cys lie Gin Ser 
215 

Gin Gly Gin Leu 
230 

Ala Arg Val His 
245 

Gly Ser Gly Pro 



His Gin Ser Ala 
280 

Lys Gly His Ser 
295 

Pro Asn Ser Ser 
310 

Trp Leu Gin Phe 
325 



Leu Pro Arg Leu 
25 

Leu Asn Leu Gly 



Gly Asn Phe Thr 
60 

Lys Trp Gin Thr 
75 

Leu Asp Arg Cys 
90 

Arg Arg Leu Lys 
105 

Lys Tyr Leu Pro 



Val Val Asn His 
140 

Lys Ala Gly lie 
155 

Cys Gly Ser Pro 
170 

Val Phe Gin Thr 
185 

Ser Pro Gly lie 



Gin Leu Arg Lys 
220 

Ala Gly Arg Gin 
235 

Pro Ser Ser Trp 
250 

Thr His Asn Cys 
265 

Val Arg Lys Ala 



Ser Ser Gly His 
300 

Arg Ser Gin Ser 
315 

Arg Asn Ser Glu 
330 



Ala Asp Glu Gly 
30 

Asn Leu Asn Val 
45 

Gly Leu Tyr Ser 



Pro Ser Phe Pro 
80 

Lys Gin Phe Val 
95 

Leu lie Met Pro 
110 

Leu Asp Lys Gly 
125 

Tyr Phe Gin Thr 



Leu Tyr Lys Arg 
160 

Tyr Ser Trp Glu 
175 

Ser Lys Arg His 
190 

Leu Pro Arg Ser 
205 

Ser Arg Leu Gly 



Gin Gly Gly Ser 
240 

Gly Thr Val Gly 
255 

Ala Ser Ser Ser 
270 

Ala Tyr Ser Leu 
285 

Ala Val Glu Leu 



Gin Gly Pro Val 
320 

Pro Cys Ser Glu 
335 



Tyr Cys Leu Cys 
340 

Thr Glu His Gly 
355 

Val Thr Gly Gly 
370 

Glu Ser Arg Leu 
385 

Arg Val Ser Trp 



Asn Leu Leu Ser 
420 

Ala Phe Tyr His 
435 

Val Gly Ser Ser 
450 

Ser Arg lie Asn 
465 

Ser Cys Ser Arg 



Tyr Gly Gin Lys 
500 

Arg Lys lie Pro 
515 

Phe Thr Ser Ala 
530 

Leu Ala Phe Ser 
545 

Gin His Leu Glu 



Leu Gly lie His 
580 

Leu Asn Phe Met 
595 



Glu His lie Val 
610 

Asn Arg Pro lie 
625 

Gly Phe Ala Ala 



His lie Val Asn 



Glu His Arg lie 
360 

Val Phe Leu Val 
375 

Val Val Asp Phe 
390 

Pro Lys Phe Ala 
405 

Ser Asn Leu Ser 



Leu Pro Leu His 
440 

Gly Leu Ser Arg 
455 

Asn Asn Glu His 
470 

Asn Leu Tyr Val 
485 

Leu His Leu Tyr 



Met Gly Val Gly 
520 

lie Cys Ser Val 
535 

Tyr Met Asp Asp 
550 

Ser Leu Tyr Ala 
565 

Leu Asn Pro His 



Gly Tyr Val lie 
600 



Gin Lys lie Lys 
615 

Asp Trp Lys Val 
630 

Pro Phe Thr. Gin 
645 



Leu lie Glu Asp 
345 

Arg Thr Pro Arg 



Asp Lys Asn Pro 
380 

Ser Gin Phe Ser 
395 

Val Pro Asn Leu 
410 

Trp Leu Ser Leu 
425 

Pro Ala Ala Met 



Tyr Val Ala Arg 
460 

Arg Thr Met Glu 
475 

Ser Leu Met Leu 
490 

Ser His Pro lie 
505 

Leu Ser Pro Phe 



Val Arg Arg Ala 
540 

Val Val Leu Gly 
555 

Ala Val Thr Asn 
570 

Lys Thr Lys Arg 
585 

Gly Ser Trp Gly 



Met Cys Phe Arg 
620 

Cys Gin Arg lie 
635 

Cys Gly Tyr Pro 
650 



Trp Gly Pro Cys 
350 

Thr Pro Ala Arg 
365 

His Asn Thr Ala 



Arg Gly Asn Thr 
400 

Gin Ser Leu Thr 
415 

Asp Val Ser Ala 
430 

Pro His Leu Leu 
445 

Leu Ser Ser Asn 



Asn Leu His Asn 
480 

Leu Tyr Lys Thr 
495 

lie Leu Gly Phe 
510 

Leu Leu Ala Gin 
525 

Phe Pro His Cys 



Ala Lys Ser Val 
560 

Phe Leu Leu Ser 
575 

Trp Gly Tyr Ser 
590 

Thr Leu Pro Gin 
605 



Lys Leu Pro Val 



Val Gly Leu Leu 
640 

Ala Leu Met Pro 
655 



Leu Tyr Ala Cys lie Gin Ala Lys Gin Ala Phe Thr Phe Ser Gin Thr 
660 665 670 



Tyr Lys Thr Phe Leu Ser Lys Gin Tyr Leu Asn Leu Tyr Pro Val Ala 
675 680 685 

Arg Gin Arg Pro Gly Leu Cys Glu Val Phe Ala Asp Ala Thr Pro Thr 
690 695 700 

Gly Trp Gly Leu Ala lie Gly His Gin Arg Met Arg Gly Thr Phe Val 
705 710 715 720 

Ser Pro Leu Pro lie His Thr Ala Glu Leu Leu Ala Ala Cys Phe Ala 
725 730 735 

Arg Ser Arg Ser Gly Ala Lys Leu lie Gly Thr Asp Asn Ser Val Val 
740 745 750 

Leu Ser Arg Lys Tyr Thr Ser Phe Pro Trp Leu Leu Gly Cys Ala Ala 
755 760 765 

Asn Trp lie Leu Arg Gly Thr Ser Phe Val Tyr Val Pro Ser Ala Leu 
770 775 780 

Asn Pro Ala Asp Asp Pro Ser Arg Gly Arg Leu Gly Leu Tyr Arg Pro 
785 790 795 800 

Leu Leu Arg Leu Leu Tyr Arg Pro Thr Thr Gly Arg Thr Ser Leu Tyr 
805 810 815 

Ala Asp Ser Pro Ser Val Pro Ser His Leu Pro Asp Arg Val His Phe 
820 825 830 

Ala Ser Pro Leu His Val Ala Trp Arg Pro Pro 
835 840 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Lys Ala Asn Ser Glu Asn Pro Asp Trp Asp Leu Asn Pro His 
35 40 45 



Lys Asp Asn Trp 
50 

Pro Gly Phe Thr 
65 

Ala Gin Gly Leu 



Thr Asn Arg Gin 
100 

Arg Asp Thr His 
115 

Gin Thr Leu Gin 
130 

Gly Ser Ser Ser 
145 

lie Ser Ser lie 



Asn He Ala Ser 
180 

Phe Phe Leu Leu 
195 

Trp Trp Thr Ser 
210 

Gin Asn Ser Gin 
225 

Pro He Cys Pro 



Phe Leu Cys He 
260 

Asp Tyr Gin Gly 
275 

Thr Thr Ser Thr 
290 

Thr Ser Met Phe 
305 

Cys Thr Cys He 



Trp Glu Trp Ala 
340 

Phe Val Gin Trp 
355 



Pro Asp Ala Asn 
55 

Pro Pro His Gly 
70 

Leu Thr Thr Val 
85 

Ser Gly Arg Gin 



Pro Gin Ala Met 
120 

Asp Pro Arg Val 
135 

Gly Thr Val Ser 
150 

Leu Ser Lys Thr 
165 

Gly Leu Leu Gly 



Thr Lys He Leu 
200 

Leu Asn Phe Leu 
215 

Ser Gin He Ser 
230 

Gly Tyr Arg Trp 
245 

Leu Leu Leu Cys 



Met Leu Pro Val 
280 

Gly Pro Cys Lys 
295 

Pro Ser Cys Cys 
310 

Pro He Pro Ser 
325 

Ser Val Arg Phe 



Phe Val Gly Leu 
360 



Lys Val Gly Val 
60 

Gly Leu Leu Gly 
75 

Pro Ala Ala Pro 
90 

Pro Thr Pro Leu 
105 

Gin Trp Asn Ser 



Arg Ala Leu Tyr 
140 

Pro Ala Gin Asn 
155 

Gly Asp Pro Val 
170 

Pro Leu Leu Val 
185 

Thr He Pro Gin 



Gly Gly Thr Pro 
220 

Ser His Ser Pro 
235 

Met Cys Leu Arg 
250 

Leu He Phe Leu 
265 

Cys Pro Leu He 



Thr Cys Thr Thr 
300 

Cys Thr Lys Pro 
315 

Ser Trp Ala Phe 
330 

Ser Trp Leu Ser 
345 

Ser Pro Thr Val 



Gly Ala Phe Gly 



Trp Ser Pro Gin 
80 

Pro Pro Ala Ser 
95 

Ser Pro Pro Leu 
110 

Thr Thr Phe His 
125 

Phe Pro Ala Gly 



Thr Val Ser Ala 
160 

Pro Asn Met Glu 
175 

Leu Gin Ala Gly 
190 

Ser Leu Asp Ser 
205 

Val Cys Leu Gly 



Thr Cys Cys Pro 
240 

Arg Phe He He 
255 

Leu Val Leu Leu 
270 

Pro Gly Ser Thr 
285 

Pro Ala Gin Gly 



Thr Asp Arg Asn 
320 

Ala Lys Tyr Leu 
335 

Leu Leu Val Pro 
350 

Trp Leu Ser Val 
365 



lie Trp Met Met Trp Phe Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser 
370 375 380 

Pro Phe Met Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 

385 390 395 400 



INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Gin Leu Phe His Leu Cys Leu lie lie Ser Cys Ser Cys Pro Thr 
15 10 15 

Val Gin Ala Ser Lys Leu Cys Leu Gly Trp Leu Trp Gly Met Asp lie 
20 25 30 

Asp Pro Tyr Lys Glu Phe Gly Ala Ser Val Glu Leu Leu Ser Phe Leu 
35 40 45 

Pro Ser Asp Phe Phe Pro Ser lie Arg Asp Leu Leu Asp Thr Ala Ser 
50 55 60 

Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His 
65 70 75 80 

His Thr Ala Leu Arg Gin Ala lie Leu Ser Trp Gly Glu Leu Met Asn 
85 90 95 

Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala Ser Arg Glu 
100 105 110 

Leu Val Val Ser Tyr Val Asn Val Asn Met Gly Leu Lys lie Arg Gin 
115 120 125 

Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val 
130 135 140 

Leu Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala 
145 150 155 160 

Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro Glu Thr Thr 
165 170 175 

Val Val Arg Arg Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser Pro 
180 185 190 

Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg 
195 200 205 



Glu Ser Gin Cys 
210 



(2) INFORMATION FOR SEQ ID NO : 5 : 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Ala Ala Arg Leu Cys Cys Gin Leu Asp Pro Ala Arg Asp Val Leu 
15 10 15 

Cys Leu Arg Pro Val Gly Ala Glu Ser Arg Gly Arg Pro Leu Pro Gly 
20 25 30 

Pro Leu Gly Ala Leu Pro Pro Ala Ser Pro Pro Val lie Pro Thr Asp 
35 40 45 

His Gly Ala His Leu Ser Leu Arg Gly Leu Pro Val Cys Ala Phe Ser 
50 55 60 

Ser Ala Gly Pro Cys Ala Leu Arg Phe Thr Ser Ala Arg Arg Met Glu 
65 70 75 80 

Thr Thr Val Asn Ala His Gly Asn Leu Pro Lys Val Leu His Lys Arg 
85 90 95 

Thr Leu Gly Leu Ser Ala Met Ser Thr Thr Asp Leu Glu Ala Tyr Phe 
100 105 110 

Lys Asp Cys Val Phe Asn Glu Trp Glu Glu Leu Gly Glu Glu Val Arg 
115 120 125 

Leu Lys Val Phe Val Leu Gly Gly Cys Arg His Lys Leu Val Cys Ser 
130 135 140 

Pro Ser Pro Cys Asn Phe Phe Thr Ser Ala 
145 150 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
ATAAGCTTATG CCCCTATCTT ATCAACACTT CCGGA 



(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
GAGTCTAGAC TCTGCGGTAT TGTGA 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 
GAGTCTAGAC TCGTGGTGGA CTTCT 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
TGAGAATTCT CACGGTGGTC TCCATGCGAC GT 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
TTTGTTTACG TCCCGT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
CTAAGCTTAG TTTCCGGAAG TGTTGAT 



